Protein Secondary Structure Prediction with Hydrophobicity and Hydrophobic Moment
نویسندگان
چکیده
Protein secondary structure prediction has been satisfactorily performed by machine learning techniques such as support vector machines (SVM’s). We discuss a special technique to include hyrophobicity information to further improve the classification results. Hydrophobicity or hydrophobic moment measure of each amino acid is included within a given window length in the protein secondary structure prediction using support vector machines. The input data is divided into two groups, which is subsequently classified by an SVM. By including hydrophobicity or hydrophobic moment, the classification accuracy is increased. Comparing the accuracy between using 1 SVM and 2 SVMs. 2 SVMs method has 3-9% higher accuracy than 1 SVM method. INTRODUCTION Protein structure prediction is a research topic of growing interest. The number of protein sequences deposited in the Protein Data Bank (PDB) grows faster than the numberof known protein structures. It is very time-consuming to crystallize each protein and use X-ray or nuclear magnetic resonance to analyze its structure. Higher accuracy in secondary structure prediction by using machine learning techniques may help predict tertiary structure more precisely. Such research is usually initiated by using basic local alignment search tool (BLAST) or position-specific iterated BLAST (PSIBLAST) to find similar homologous proteins. Then, the sequences are aligned, and the position-specific scoring matrix (PSSM) is calculated. This is typically followed by a machine learning algorithm such as a neural network (NN) or a support vector machine (SVM) to do secondary structure prediction. It is known that discriminating features may help improve classification accuracy. In this paper, we investigate how best to include new features to improve secondary structure prediction. We focus on several membrane proteins which have special distributions with respect to input features. In a previous paper on secondary structure prediction with support vector machines (Ibrikci et al., 2005), the protein structure was classified into 8 classes. In another paper (Rost et al., 2003), the protein structure was classified into 3 classes as alphahelix, beta sheet and coil. In this paper, we first classify the data into 2 classes, such as alpha-helix and non-alpha-helix. Later by combining the results of three binary classifiers, the three-class classifier is obtained. The window size was
منابع مشابه
The Effect of Hydrophobicity and Hydrophilicity of Gold Nanoparticle on Proteins Structure and Function
The surface parameter of nanoparticles such as hydrophobicity and a hydrophilicity on protein structure and function is very important. In this study, conformational changes of glucose oxidase (GOx) in the mercaptopurine: GNPs and 11-mercaptoundecanoic acid: GNPs as a hydrophobic and a hydrophilic GNPs surface was investigated by various spectroscopic techniques, including: UV-Vis absorption, f...
متن کاملThe Effect of Hydrophobicity and Hydrophilicity of Gold Nanoparticle on Proteins Structure and Function
The surface parameter of nanoparticles such as hydrophobicity and a hydrophilicity on protein structure and function is very important. In this study, conformational changes of glucose oxidase (GOx) in the mercaptopurine: GNPs and 11-mercaptoundecanoic acid: GNPs as a hydrophobic and a hydrophilic GNPs surface was investigated by various spectroscopic techniques, including: UV-Vis absorption, f...
متن کاملCloning and characterization of MAP2191 gene, a mammalian cell entry antigen of Mycobacterium avium subspecies paratuberculosis
The aim of this study is to identify, clone and express a Mycobacterium avium subsp. paratuberculosis specific immunogenic antigen candidate, in order to develop better reagents for diagnosis and vaccines for the protection of the host. Therefore, MAP2191 gene (a member of MAPmce5 operon) from MAP, was isolated and characterized by Bioinformatics tools and <e...
متن کاملIn Silico and in Vitroinvestigations on cry4aand cry11atoxins of Bacillus thuringiensis var Israelensis
In the present study we attempted to correlate the structure and function of the cry11a (72 kDa) and cry4a (135 kDa) proteins of Bacillus thuringiensis var israelensis. Homology modeling and secondary structure predictions were done to locate most probable regions for finding helices or strands in these proteins. The JPRED (JPRED consensus secondary structure prediction server) secondary struct...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کامل